An O(log2N) parallel algorithm for output queuing

نویسندگان

  • Sadia Sharif
  • Adnan Aziz
  • Amit Prakash
چکیده

Output queued switches are appealing because they have better latency and throughput than input queued switches. However, they are difficult to build: a direct implementation of an N × N output-queued switch requires the switching fabric and the packet memories at the outputs to run at N times the line rate. Attempts have been made to implement output queuing with slow components, e.g., by having memories at both inputs and outputs running at twice the line rate. In these approaches, even though the packet memory speed is reduced, the scheduler time complexity is high — at least Ω(N). We show that idealized output queuing can be simulated in a shared memory architecture with (3N − 2) packet memories running at the line rate, using a scheduling algorithm whose time complexity is O(log N) on a parallel random access machine (PRAM). The number of processing elements and memory cells used by the PRAM are a small multiple of the size of the idealized switch.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Fast and Simple Randomized Parallel Algorithm for the Maximal Independent Set Problem

A simple parallel randomized algorithm to find a maximal independent set in a graph G = (V, E) on n vertices is presented. Its expected rmming time on a concurrent-read concurrent-write PRAM with 0( 1 E 1 d,,) processors is O(log n), where d,, denotes the maximum degree. On an exclusive-read exclusive-write PRAM with 0( 1 El) processors the algorithm runs in O(log2n). Previously, an O(log4n) de...

متن کامل

Fast and Efficient Parallel Solution of Dense Linear Systems

The most efficient previously known parallel algorithms for the inversion ofa nonsingular n x n matrix A or solving a linear system Ax = b over the rational numbers require O(log2n) time and M(n).~ processors [provided that M(n) processors suffice in order to multiply two n × n rational matrices in time O (log n)]. Furthermore, the known polylog arithmetic time algorithms for those problems are...

متن کامل

Serial and Parallel Algorithms for the Medial Axis Transform

We develop an O(n2) time serial algorithm to obtain the medial axis transform (MAT) of an n×n image. An O(logn) PRAM and an O(log2n) hypercube parallel algorithm for the MAT are also developed. Both of these use O(n2) processors. Two problems associated with the MAT are also studied. These are the area and perimeter reporting problem. We develop an O(logn) time hypercube algorithm for both of t...

متن کامل

String Editing on an SIMD Hypercube Multicomputer

SIMD hypercube algorithms to determine a minimum cost edit sequence to transform one string into another are developed. If the two strings are of length n, our algorithms take O √ p nlogn ______ + log2n time when n2p, 1 ≤ p ≤ n, processors are available. When p2 , nlogn ≤ p2 < n2 processors are available, the complexity of our algorithm is O p n1.5 ____ √ logn .

متن کامل

A Quasi-Random Approach to Matrix Spectral Analysis

Inspired by quantum computing algorithms for Linear Algebra problems [6, 14] we study how simulation on a classical computer of this type of “Phase Estimation algorithms” performs when we apply it to the Eigen-Problem of Hermitian matrices. The result is a completely new, efficient and stable, parallel algorithm to compute an approximate spectral decomposition of any Hermitian matrix. The algor...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002